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Sir; 

T, MAURO MAGNANI, Ph_D., do declare as follows: 

K T am currently Professor of Biochemistry, Director of the section 
Biotechnology, Vice-Rector of the University of Urbino, Urbino, Italy, from 2001 to 
September 24, 2009. I have over thirty years of experience as a biochemist in the research 
and development of products and applications useful in the biotechnology and 
pharmaceutical industries. I am included in the official list of professional biologists in Italy 
with n. 01 7484 "Ordine Nazionale Biologi," and I am a Technical Director nominated by the 
"Agenzia Italiana del Farmaco, AIFA" with n. AIDT- 19/2005. My education and experience 
are summarized on my Curriculum Vitae, which is attached hereto as Exhibit 1 , 

2. I have collaborated with Dr. Barbara Ensoli, who is the inventor of the above- 
identified application No. 09/555,534 (hereinafter "the '534 application"), in the development 
of methods to produce recombinant, biologically active HTV Tat protein. T also supervise and 
have supervised the production of such biologically active HIV Tat protein, according to 
good manufacturing practices (GMP), for use in human clinical trials. In particular, I have 
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experience in reversed-phase high pressure liquid chromatography (RP-HPLC) purification 
of biologically active HIV Tat protein, which my laboratory has carried out many times. 

3. This Fourth Declaration is intended to supplement the Third Declaration of 
Mauro Magnani, Ph.D. Under 37 C.F,R. § 1.132 ("Third Magnani Declaration") filed January 
8, 2009 for the *534 application. In particular, this Fourth Declaration provides further 
evidence that one skilled in the an as of December 1, 1997 would understand that an acid 
exchange reaction such as that described in U.S. Patent No. 5,646,120 by Sumner-Smith et ai 
("Sumner-Smith et al") would be expected to destroy the biological activity of an HIV Tat 
protein purified by RP-HPLQ even if the acid exchange reaction is performed before a 
Iyophilization and resuspension step such as that described in Chang el aL, "HIV-1 Tat 
protein exits from cells via a leaderless secretory pathway and binds to extracellular matrix- 
associated heparin sulphate proteoglycans through its basic region," AIDS, 1997, 

1 1(12):1421-143 1 ("Chang el al"). Indeed, as discussed below, it is the performance of the 
acid exchange reaction and consequent change of pH in an aqueous environment that 
promotes loss of biological activity of the HIV Tat protein, regardless of whether the acid 
exchange reaction occurs just after RP-HPLC or just after lyophilization and resuspension. 

4. As disclosed in the specification of the '534 application (see page 24, lines 21- 
25), the conformation and biological activity of an HIV Tat protein is strongly affected by the 
protein's cysteine rich region. When oxidized, the cysteine residues of the HTV Tat protein 
form intra- and inter-molecular disulfide bonds, thus modifying the conformation of the HIV 
Tat protein, causing aggregation and loss of biological activity. At an acidic pH, the HIV Tat 
protein is reduced and in a random coil conformation; as the pH increases, the protein tends 
to change its conformation, forming a non-biologically active structure stabilized by disulfide 
bonds. 

5. Sumner-Smith e( ah discloses treating a column-eluted oligopeptide to 
exchange the cleavage acid, such as trifluoracetic acid (TFA), with a pharmaceutically 
acceptable acid, such as acetic acid, to provide a water soluble salt of the peptide (see col. 9 ? 
lines 57-61 ). Such an acid exchange reaction, exchanging TFA for an acid like acetic acid, I 
understand to be a common practice for peptides, but not for proteins. The reason for this is 
that in general, proteins must retain their three-dimensional conformation in order to retain 
their biological activity, whereas short peptides often are not reliant on their three- 
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dimensional conformation for their biological activity or can recover their biological activity 
more easily than larger proteins. Acetic acid, which has a pKa of 4.76, is much less acidic 
than TFA, which has a pKa of 0.3. A person skilled in the art would understand that even a 
small increase in pH could cause a significant change in a protein's hydrogen exchange rate 
with water and changes in the protein conformation and biological activity. Indeed, a simple 
change in pH from 3.3 to 5.8 causes an increase in the hydrogen exchange rate with water on 
the order of 250-fold for an HTV Tat protein, as determined by NMR spectroscopy, with 
consequent appearance of local conformation (see Shojania ei aL t "HIV-1 Tat is a natively 
unfolded protein," J Biol, Chem., 2006, 281(13):8347-8356 > at the paragraph spanning pages 
8354-8355, copy of which is attached as Exhibit 2). One would expect that, under acid 
exchange conditions, the presence of a reducing agent would be necessary, although not 
necessarily sufficient, for the HIV Tat protein to maintain a non-oxidized, non-aggregated 
conformation and thus biological activity. Accordingly, if an HIV Tat protein is subjected to 
the acid exchange reaction of Sumner-Smith et al 9 without the presence of a reducing agent, 
it is expected that the HIV Tat protein would lose biological activity. Moreover, even in the 
presence of a reducing agent (which is not taught by Sumner-Smith et al \ there is no way to 
predict whether, and thus no expectation that, the biological activity of the HIV Tat protein 
would be preserved. Thus, in my judgement and opinion, if the acid exchange reaction 
described by Sumner-Smith et a/, were to be applied to an HIV Tat protein purified by the 
RP-HPLC method of Chang et aL , the change in the pH of the HIV Tat protein in an aqueous 
environment would not be expected to preserve the biological activity of the HIV Tat protein. 
A subsequent lyophilization step, as disclosed in Chang et aL ? does not entail any steps that 
would be expected to restore the native conformation and thus biological activity of the HIV 
Tat protein- 
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I hereby declare that all statements made herein of my own knowledge are true 



and that all statements made on information and belief are believed to be true; and further that 
I make these statements with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both s under Section 1001 of Title 18 of the United 
States Code, and that such willful false statements may jeopardize the validity of this 
application, and any patent issuing thereon. 





Mauro Magnani, Ph D. 
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EXHIBIT 1 



Curriculum Vitae - Prof. Mauro Magnani 



MAGNANI Prof. Mauro, Ph.D. Italian, Professor of Biochemistry 
BORN: April 9, 1953, Italy 
LANGUAGES: Italian, English 
EDUCATION: Univ. Urbino, Italy, Ph.D., 1976 

PRIMARY POSITION: Professor of Biochemistry and Director Centre of Biotechnology. 
PROFESSIONAL CAREER: Visiting Researcher, Dept. Biochemistry, Univ. Birmingham, 
1980; Visiting Prof. Dept. Biolgy, Haifa, Israel, 1983; Asst. Prof. Univ. Urbino, 1977-82, 
Assoc. Prof., 1982-1986, Prof. 1986 - ; Dean, Faculty of Sciences University of Urbino 1995- 
2001; Director Interuniversity Consortium for Biotechnology (CIB) 1998-2004; Vice Rector 
of the University of Urbino 2001- Sept. 24, 2009 .Include in the official list of professional 
biologist in Italy with n. 017484 "Ordine Nazionale Biologi". Technical Director nominated 
by the "Agenzia Italiana del Farmaco, AIFA" with n. AIDT- 19/2005. 

CURRENT RESEARCH: Development of new drug delivery and drug targeting systems; 
Protein turnover ubiquitination and regulation of gene expression; Mechanisms of drug 
resistance and drug toxicity; Modulation of NF-kB and gene expression by oligonucleotide 
decoys, vaccine development; nanobiotechnology in drug delivery. 

PUBLICATIONS: over 350 articles published in international refereed scientific journals; 
Co-editor of three books: 

"Red Blood Cell Aging", Plenum Press, N.Y., 1991, pp. 383. 

"The Use of Resealed Erythrocytes as Carriers and Bioreactors", Plenum Press, N.Y., 1992, 
pp. 361. 

"Erythrocyte Engineering for Drug Delivery and Targeting", Landes Bioscience, 2002. 
REFEREE: Programmes of the E.U.; The International Science Foundation (U.S.A.); Target 
Project "Biotechnology" of the National Research Council (C.N.R.).; Member of the Project 
"Patologia clinica e terapia delPinfezione da HIV" of the Italian Ministry of Health; PRIN 
and FIRB Projects of Italian Ministry of University and Research; Member of Committee 
Post Genoma (C.N.R); Include in the "Albo degli Esperti" of M.I.U.R. and Eureka Projects of 
EU. 

REVIEWER: Biotechnology and Applied Biochemistry; Nature Biotechnology; Drugs; 
Leukemia; European Journal Haematology; Biochimica et Biophysica Acta; Blood; Journal of 
Cellular Engineering; Journal of Internal Medicine; Journal of Acquired Immune Deficiency 
Syndromes and Human Retrovirology; Mechanisms of Ageing and Development; Antiviral 
Research; Journal of Chromatography; Journal of Biological Regulators and Homeostatic 
Agents; Life Sciences; Biochemistry; International Journal of Biochemistry and Cell Biology; 
Human Gene Therapy; European Journal of Biochemistry; Clinical Pharmacokinetics; 
Autoimmunity; Oncogene; Haematologica; J. Controlled Release; Editorial Board: Current 
Drug Targets, Biotechnology. 

PATENTS 

Europen Patent EP 0517986B1 

M. Magnani, L. Rossi "Transformed erythrocytes, process for preparing the same, and their 
use in pharmaceutical compositions" 



US Patent 5,753,221 



M. Magnani, L. Rossi "Transformed erythrocytes, process for preparing the same, and their 
use in pharmaceutical compositions" 

US Patent N. 6.139.836 

Mauro Magnani, Ivo Panzani, Leonardo Bigi, Andrea Zanella "Method of encapsulating 
biologically active agents within erythrocytes, and apparatus therefor". 
Assignee: Dideco S.p.A., Mirandola, Italy 

European Patent N. EP98830479.6 

M. Magnani, G. Brandi, A. Fraternale, A. Casabianca "Pharmaceutical composition or 
composition package containing a pyrimidine nucleoside analogue and a purine nucleoside 
analogue 99 . 

Brevetto C.N.R. N. RM92 A 000377 

M. Magnani "Antigeni legati alia superflcie esterna di eritrociti e procedimento per la loro 
preparazione" 

Brevetto C.N.R. N. RM 93 A 000474 

M. Magnani "Eritrociti incorporanti alcool ossidasi e loro uso nelle intossicazioni da 
metanolo " 

Brevetto C.N.R. 

M. Magnani, L. Rossi, G. Brandi, E. Millo, G. Damonte, U. Benatti, A. De Flora 

"Profarmaco di acyclovir e suo uso in composizioni farmaceutiche" 

Brevetto di Invenzione N. MI2002A01 196 - 06/06/1996 - PCT/IT 02/00368 del 13/06/2002 
M. Magnani, C. Fiorucci, P. Filippone, G. Brandi, M. Paiardini. "Derivato tetramerico 
delVindol-3 carbinolo ad attivitd anticancerogena e metodo di sintesi del derivato stesso". 

Brevetto di Invenzione N. TO2001 A01077 - 16/1 1/2001 

M. Magnani, F. Graziano, A. Ruzzo "Mutazioni della linea germinale nel promotore del gene 
della E-caderina e metodi di diagnosi per individuare una maggiore suscettibilita al 
carcinoma gastrico ". 

Brevetto N. TO2003A001048 - 30/12/2003 - PCT/EP/2004/053726 - 29/12/2004 
U.Benatti, G. Brandi, E. Garaci, M. Magnani, E. Millo, A.T. Palamara, L. Rossi. "Derivati del 
glutathione e loro utilizzo per il trattamento di malattie virali". 
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Supplemental Material can be found at: 
http://www.jbc.org/cgi/content/full/M510748200/DC1 
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THE SOLUTION CONFORMATION AND DYNAMICS OF REDUCED HIV- 1 Tat~(1-72) 
BYNMR SPECTROSCOPY*® 
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Shaheen Shojania and Joe D, CXNeil 1 

From the Department of Chemistry, University of Manitoba, Winnipeg, Manitoba R3T 2N2, Canada 



Tat (transact! vat or of transcription) is a small RNA-binding pro- 
tein that plays a central role in the regulation of human immunod- 
eficiency virus type A replication and in approaches to treating 
latently infected ceils. Its interactions with a wide variety of both 
intracellular and extracellular molecules is well documented. A 
molecular understanding of the multitude of Tat a ctivities requires 
a determination of its structure and interactions with cellular and 
viral partners. To increase the dispersion of NMR signals and per- 
mit dynamics analysis by multinuclear NMR spectroscopy;, we have 
prepared uniformly J ^N~ and J "°N/ i3 C-labeled. Tat- (1—72) protein. 
The cysteine -rich protein is unambiguously reduced at pH 4.1, and 
NMR chemical shifts and coupling constants suggest that it exists in 
a random coil conformation. Line broadening and multiple peaks in 
the Cys-rich and core regions suggest that transient folding occurs 
In two of the five sequence domains. NMR relaxation parameters 
were measured and analyzed by spectral density and Lipari-Szabo 
approaches, both confirming the lack of structure throughout the 
length of the molecule. The absence of a fixed conformation and the 
observation of fast dynamics are consistent with the ability of Tat 
protein to interact with a wide variety of proteins and nucleic acid 
and support the concept of a natively unfolded protein. 



Human immunodeficiency virus type 1 (HIV-- if- infected resting CD4 + 
memory T -cells form a persistent viral reservoir that is a major barrier to 
curing HIV-1 infection although they do not permit viral replication (1). A 
small, RNA -binding protein. Tat (transactiva tor of transcription) (2), plays 
a central role in the regulation of HIV-1 replication and in potential 
approaches to treating latently infected cells. During transcription of the 
viral DNA, RNA polymerase H stalls as a result of binding the negative 
transcription elongation factor, resulting in the production of prematurely 
terminated transcripts that may include the tat message (3). Following 
translation, the Tat protein is transported from the cytoplasm into the 
nucleus, where it binds a stem-loop structure (transactiva tton response 
element (TAR)) formed by the first 59 nucleotides of the HIV-1 RNA (4). 
Tat stimulates elongation of full-length transcripts by recruiting the posi- 
tive tr anscription elongation factor b, a complex of a regulatory cyclin and 
cycl in -dependent kinase 9, that phosphorylates the C-terminal domain of 

*This work was supported by grants from the Natural Sciences and Engineering 
Research Council of Canada and the University of Manitoba; it was initiated with 
funding from the Medical Research Council of Canada and the Manitoba Health 
Research Council. The costs of publication of this article were defrayed in part by the 
payment of page charges. This article must therefore be hereby marked '''advertise - 
menf in accordance with '18 U.S,C Section '! 734 solely to indicate this fact, 

L5J The on-line version of this article (available at http://www.jbc.org) contains supple- 
ments! rigs. 1S-5S and Table IS. 

■ To whom correspondence should be addressed. Tel,; 204-474-6697; rax: 204-474-7608; 
E-mail: joneii@cc.umanitoba.ca, 

J The abbreviations used are: HIV, human immunodeficiency virus; HSQC, heteronuclear 
single quantum coherence; MES f 2-(/V-rriorpho3ino)ethanesulfonic acid; NOE, nuclear 
Overhauser effect; SW, sweep width; TAR, transact! vat ion response element; CREB, 
c A M P- respo n se e I e men t. 



RNA polymerase II, components of negative transcription elongation fac- 
tor, and the transcription elongation factor Spt5 (5—7), Recent results sug- 
gest that Tat activates positive tr anscription elongation factor b by displac- 
ing Hexim! (hexamethylene bisa.cetam.ide- inducible protein 1) from its 
cyclin Tl binding site (8) and that the affinity of the Tat-cyclin Tl-cyciin- 
dependent kinase 9 complex for TAR is regulated through Tat acetylation 
by histone acetyl transferases (9, 10). The absence of Tat and low levels of 
cyclin -dependent kinase 9 and cyclin IT in resting CD4 " T-cells are both 
implicated in HIV-1 latency (I). Tat may also be involved in derepression of 
heterochromatin, transcription initiation (II), and reverse transcription 
(12). In addition to its intracellular activities, the literature contains a pleth- 
ora of reports of extracellular Tat activities, including general cytotoxicity, 
that may contribute to immune suppression (13, 14) and the development 
of HIV -dementia (15). 

A molecular understanding of Tat activity requires a determination 
of its structure and interactions with cellular and viral partners (16, 17). 
HIV-1 Tat is a 101-residue protein encoded by two exons (3, IS). The 
first exon defines amino acids 1—72 that encompass an acidic and pro- 
line-rich N terminus (amino acids 1—21), a cysteine -rich region (amino 
acids 22—37), a core (amino acids 38 — 47). a basic region (amino acids 
48 -57), and a Gin-rich segment (amino acids 58—72) (19); it activates 
transcription with the same proficiency as the full-length protein (3, 
20—22). Residues 1—24 form the co-activator and acetyltransferase 
CREB-binding protein KLX domain binding site (17), Cyclin Tl is 
thought to interact with the Cys-rich region of Tat (23), and mutation of 
anyone of six of the seven Cys residues results in inactivation (3). The 
end of the Cys-rich region and the core are involved in mitochondrial 
apoptosis of bystander nomnfeeted cells through their ability to bind 
tubulin and prevent its depolymerization (24). The basic region is 
important for FAR binding (25) and nuclear localization, and the seg- 
ment between Tyr-47 and Arg-57 has been used to transport a large 
variety of materials, including proteins, DNA, drugs, imaging agents, 
liposomes, and nan op articles across cell and nuclear membranes (26). 
The Gin-rich region has been implicated in mitochondrial apoptosis of 
T-cells (27). The second tat exon defines residues 73—101 and includes 
an RGD motif that may mediate Tat binding to cell surface integrins 
(28) . Despite numerous attempts, the biological function of the second 
exon-encoded polypeptide has been difficult to determine (12, 22). 

The Tat ammo acid sequence has a low overall hydrophobic* ty and a 
high net positive charge, and analyses by several algorithms (29, 30) 
suggest that it is a natively unfolded protein (31, 32) with a possible 
folding nucleus between residues 42 and 75; CD spectropolarimetry 
confirms the lack of secondary structure (17). Proteome analysts sug- 
gests that about 33% of eukaryotic proteins are natively unfolded or 
contain long stretches of unfolded polypeptide, whereas unfolded pro- 
teins are much less abundant among the archaea and eubacterta (5%) 
(33). One advantage that unfolded proteins have over folded proteins is 
that they provide a much larger surface area, enabling multiple interac- 
tions with other molecules (34). The most common types of natively 
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Conformation and Dynamics of HIV- 1 Tat-( 1-72) 

unfolded protein in yeast are transcription, factors and transcription 
regulators as well as kinases (35), and DNA. is the most common target 
bound by unfolded proteins. A common mechanism of action for 
natively unfolded proteins involves partial or complete folding upon 
interaction with a binding partner (31, 32). Areas of residual structure in 
unfolded proteins can be critical to forming the initial site of interaction 
with a binding partner (32), and delineation of those sites is required for 
the design of drugs and antibodies that interfere with binding. 

There have been several attempts to determine solution conformations 
of Tat and its segments both alone and in complexes. 1 H NMR spectros- 
copy and molecular dynamics simulations suggested that Tat-(1— 86) 
(Z-variant) forms condensed domains encompassing the core and Gin-rich 
regions, whereas the basic and Cy$-rich regions were found to be highly 
flexible at pi t 6.3 under reducing conditions (36). In a model of the 87- res- 
idue Tat Ma! protein at pH 4.5 under oxidizing conditions, the N- terminal 
Trp-Il forms a hydrophobic core through interactions with Phe-38 and 
Tyr-47 (37). The basic region is in an extended conformation, and the 
Cys-rich region contains j3- turns; a -helix is found in the Gin -rich segment. 
A low resolution, globular conformation with .some flexible segments (par- 
ticularly the basic region) was deduced for ±3 C"-G3y-labeled. synthetic Tat- 
(1—86) (Bru) at pH 4,5, in the absence of reducing agents (38), An oxidized 
Tendamistat-Tat-(1— 37) fusion protein showed multiple conformations 
with some evidence of helicity in the Cys-rich region (20 —33) at pH 3.5 (39). 
A fusion protein consisting of the activation domain from the unrelated 
equine infectious anemia virus and Tat- (48 —57) showed high helical con- 
tent in the basic domain by NMR spectroscopy and CD (40). 

There have also been several studies of Tat fragments in complex 
with T AR RNA mainly focusing on the conformation of TA R (41—45). 
NMR spectroscopy suggested a conformational change in Tat-(32— 72) 
in the region of Gly-42 and Gly-44 upon binding to Tar (44). 3 H NMR 
was also used to show that Tat- (46 -55). acetylated at Lys-50. is bound 
in an extended conformation to the bromodomain of p300/CREB-bind- 
tng protein -associated factor, a histone acetyl transferase transcriptional 
coactivator (10). CD spectra suggested the possibility of a conforma- 
tional change in Tat- (1 — 86) upon binding to the K3.X domain of CREB- 
binding protein (17). lo N NMR relaxation measurements showed that 
Tat-(47— 58) becomes slightly more ordered on binding heparin (46), 
and CD studies of overlapping peptide fragments suggested that the 
most flexible regions are those that are adjacent to the basic region (47). 

Determining the mechanisms by which proteins attain their biologi- 
cally active three-dimensional structures is a fundamental goal of biol- 
ogy, with implications ranging from molecular recognition to under- 
standing amyloid diseases (48). Multinuclear NMR spectroscopy is 
yielding insights into the folding process by characterizing unfolded 
proteins and equilibrium intermediates. The detection of residual native 
and non native structures in strongly denatured proteins suggests that 
they are important in seeding the early steps of protein folding (49 —56). 
Molten globule states of proteins have also been investigated by NMR 
and provide insight into the nature of the folding pathway and the struc- 
ture of the transition state (57, 58). Thus, although NMR spectroscopy is 
not ideally suited to kinetic studies of protein folding, it is incomparable 
in its ability to provide detailed information about the structures and 
dynamics of unfolded and partially folded proteins (59). 

] H NMR spectra of Tat are very dif ficult to assign due to poor disper- 
sion of the resonances (36 —39). in addition, fully and partially oxidized 
material is likely to be structurally heterogeneous, including the forma- 
tion of disulfide cross-linked oligomers often observed on polyacryl- 
amide gels and adding to the difficulty in structural characteriza tion. To 
increase the dispersion of the NMR signals and permit dynamics anal - 
ysis by multinuclear NMR spectroscopy, we have prepared uniformly 



1 :3 N- and l£, N/ i3 C-iabeled Tat- (1—72) protein. The protein is unambig- 
uously reduced at pH. 4.1, and NMR analysis provides insight into the 
intracellular folding and dynamics of this intrinsically disordered pro- 
tein and the folding propensities of its five sequence domains. 

EXPERiJViE^TAL PROCEDURES 

Preparation of Hist.idine-ta.gged Tat — The detailed Tat-(1— 72) expres- 
sion and purification protocol will be published elsewhere. Briefly, an Esch- 
erichia coli codon-optimized synthetic exon 1 tat gene contained in 
pSV2tat72 was obtained through the AIDS Research arid Reference Rea- 
gent Program, Division of AIDS, NJAID, National Institutes of Health, from 
Dr. Alan Frankel (60) and cloned into pET28b(-r) (Novagen, Madison, WI), 
enabling protein expression with an N- terminal His 6 segment and T7 
epitope that adds 20 residues to the 72 -residue protein. Protein expression 
was fromi* coli BL21(DE3)plysS cells (Novagen, Madison, WI) in M9 min- 
imal medium supplemented, with either ~'^NK 4 C1, or lb NH 4 Cl andU-'~C 6 - 
D- glucose (Cambridge Isotope Laboratories Inc., Andover, MA). The 
expression protocol was modified from recently published methods (61) 
that reduce the consumption of isotopicaliy labeled ingredients. The pro- 
tein was purified by elution from a Talon j lvl (Cobalt -Superfiow) metal 
affinity resin (Clontech, Palo Alto, CA) in a gravity How column in 50 mivl 
sodium acetate buffer, 6 m guanidine HO, and 10 mM Tris(2-carboxyeth- 
yl)phosphine (Sigma) adjusted to pH 4.0 with sodium hydroxide. The 
eluted protein was pooled and serially dialyzed against degassed 0.1, 0.05, 
and 0.01 M ammonium acetate at pH 3 for ~~ 6 h each. A final dialysis was 
done against degassed water for 4 h. The dialysate was then frozen and 
freeze-dried. 

NMR Sample Preparation — Freeze-dried protein was dissolved in 
degassed buffer containing 50 mM acetate--*^,, 20 mM MES buffer, 80 }jl?a 
sodium sulfite, 0.02% sodium azide, and 5% D 2 0, pH 4.0. The resulting 
protein solution had a measured pH of 4.1, uncorrected for isotope 
effects. The sample was placed in an NMR tube that had been purged 
with argon for 15 min, and the dissolved protein was added to the 
sample tube under an argon a tmosphere. The NMR tube cap was sealed 
with T eflon tape (Dupont Dow Elastomers LLC, Wilmington, DE). The 
final protein concentration was about 1 mivl. The purity of the protein 
was confirmed by matrix-assisted laser desorption ionization mass 
spectrometry. The pH titration was done by dissolving separate protein 
samples in degassed buffers as described above, with subsequent verifi- 
cation of the pH. 

NMR Backbone Assignments — All backbone assignment experiments 
for the 92 -residue ^C/^N- labeled histi dine -tagged Ta.t-(1— 72) were 
done on a 600-MHz Varian [NOVA spectrometer equipped with a tri- 
ple resonance probe head at 20.2 "C, using standard Varian Bio Pack 
pulse sequences (62—67) (see Table Ih The NMR probe was calibrated 
with methanol (68), and all spectra were processed with NMRPipe (69). 
Spectra were apodized using a squared cosine bell function, zero-filled 
to twice the data set size, and linear predicted (forward- backward with 
eight prediction coefficients) prior to Fourier transformation. The 
dimensions of the resulting processed data sets were 4096 X 1024 for 
the J H/ J5 N HSQC experiment and 2048 X 256 X 12S for all three- 
dimensional experiments. The pulse sequences used are sensitivity -en- 
hanced (with the exception of the HNHA experiment) and use gradients 
for coherence selection and water suppression ('70). Radiation damping 
was suppressed with a water flip -back pulse (1.42 ms). lb N decoupling 
during acquisition was done using the WALYZ-16 sequence (71) with a 
7T96-kHz field strength. *H chemical shifts were referenced to the 
water signal that resonates 4.821 ppm from 2,2-dimethyl-2-siiapentane 
sulfonate at 293 K (68), b N and K 'C referencing were done indirectly 
relative to 2.2 -dimethyl- 2 -silapentane sulfonate as recommended (72). 
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TABLE 1 

Acquisition parameters for the HMR experiments 

All experiments acquired with a 0.7-s pos (.acquisition relaxation delay 



Experiment 


Transients 


Complex points 


SW[*H] 


SW["C] 


SW[ 15 N] 










ppm 




Ml/ 1S N"KSQC(62) 


32 


2048 X 256 


19 




36 


HNCACB (63) 


16 


1024 X 64 X 32 


17 


70 


30 


CBCA(CO)NH (64) 


S 


1024 X 64 X 32 


10 


70 


24 


HNCO (65) 


8 


1024 X 64 X 32 


10 


8 


24 


HN(CA)CO (66) 


S 


1024 X 64 X 32 


10 


8 


24 


HNHA (67) 


8 


1024 x 64 x 32 


10 


10 


30 



Random coil values were subtracted front the measured chemical shifts 
after correction for sequence effects as described in Ref. 73. The "Ih^™ 
measured were not corrected for relaxation effects and are likely to be 
5-10% underestimated (68). The A"J HKH „ values were calculated by 
subtracting the measured COIL values reported in (74). 

NMR Relaxation Measurements — NMR relaxation data were col- 
lected on both 15 Ndabeled and 13 C/ 15 N-labeled His-tagged Tat--(l-72) 
on a Varian INOVA. 600 MHz spectrometer at the University of Mani- 
toba and on a Varian XNOVA 800-MHz spectrometer at the University 
of Alberta (NANUC) with triple resonance probe heads at 20 °C, using 
BioPack pulse sequences (70, 75, 76). Cross -peak intensities were meas- 
ured as peak heights. Spectra were processed with NMRPipe (69), which 
was also used to fit the relaxation data to two-parameter decays. The 
errors in the relaxation rates were calculated using the signal/noise 
ratios of the individual peaks and the fits of the data to the decays. 
[Duplicate measurements were made to verify the error estimates. 

A total of nine data sets were acquired to obtain R 1 relaxation rates 
using relaxation delays of 0, 50, 100, 250, 500, 1000, 1500, 3000, and 4000 
ms. R ip measurements were made with eight data sets with spin lock 
times of 30, 60, 90, 120, 150, 180, 210, and 240 ms. The 15 N spin lock 
continuous wave field strength for the R^ 0 relaxation experiments was 
1.5 kHz, with 90° pulse lengths of 166,755 and 125,029 ms for the 600 
and 800 MHz fields respectively. The R lp measurements were corrected 
for offset from the carrier, using the measured /c, values as described in 
Ref. 76. The peaks at the outer edges of the spectra required correction 
by less than 10%. R 2 measurements were done at 600 MHz only and with 
Carr-PurcelTMeihoom-Gill times of 30, 60, 90, 120, 150, 180, 210, and 
240 ms. The R, and R lp data were acquired using four transients, 
whereas eight transients were collected for the R 2 experiments: the post- 
acquisition relaxation delay was 5 s. Data collected at 600 MHz were 
2048 X 256 complex points with SW^H] ------ 10 ppm and SW[ 15 N] ------ 24 

ppm. The steady-state J H- li: NNOE values were obtained from ratios of 
peak heights from experiments with (/ NOF ) and without (/ noNOE ) satu- 
ration of the protons for 5 s at the beginning of the experiment, The 
heteronuclear NOE values were then obtained from (/ x;OE — c^ktoev 
AloN'Oe- 1" ne spectra were acquired with 32 transients, a 5-s relaxation 
delay, and the same resolution as in the R } and R lp experiments. Water 
suppression was achieved through the use of gradients to select for the 
l3 N- 1 H. coherence (70). Data collection at 800 MHz was done exactly as 
at 600 MHz, but the resolution for the experiments was 672 X 256 
complex points with SW[ l H] = 15 ppm and SW['* 5 N] = 26 ppm. 

Relaxation Data- Analysis — The measurement of NMR relaxation 
rates provides a window on protein dynamics over a broad range of time 
scales: lb N longitudinal (RC), transverse (i? 9 ), rotating frame ,), and 
heteronuclear cross- relaxation (contained in the NOE) rates are sensi- 
tive to dynamics on the picosecond to nanosecond time scales, and R 2 
and R lp can also be sensitive to conformational exchange (R e ^) on the 
millisecond to microsecond time scales. The equations relating the 
macroscopic rates of relaxation (R y ) to the values of the spectral density 
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of motions (J) at the nuclear spin transition frequencies (to) were given 
by Ahragam (78) but can be approximated as follo ws (79, 80). 

R, -------- D(3J(<w N ) -\- 7J(0.921o) H )) + C/(w N ) (Eo.i) 

R2 = Kip = D(4J(0) + 3J(o) N ) + 13J(0.955w H )) 

4- (C/6)(4J(0) + 3J(a> N )) + R ex (Eq.2) 

NOE === (7 H /7 N )D(5J(0.87ftj H )//? 1 ) (Eq.3) 

The constants in Equations 1—3 are defined by D r -'-'- (//, ( >/47r) 2 X 
(yiiJ^ 2 ) ^' r %i^ an< i C = (A ? VijJj)/3, where jx 0 is the permeability con- 
stant of free space (4?r X 10""' kg*m*s ~ 2 *A " 2 ), y^ is the proton mag- 
netogyric ratio (2.68 X 10 b rad's -1 *!^ 1 ), y N is the magnetogyric ratio 
of J, N ( — 2.71 X 10' rad-s^-T -1 ), r>. rrl is the pro ton -nitrogen inter- 
nuclear separation (104 pm), A is the difference between the parallel 
and perpendicular components of the X "N chemical shift tensor 
( — 163 ppm), andti. is Planck's constant divided by 2 77 (1.05 X 10" 34 
T's). Note that R 2 and R ip are determined by the same combination of 
spectral density values as long as the JO N spin lock is on resonance for 
all spins (SI). An exact solution of Equations 1—3, which contain six 
unknowns, is not possible, but because the spectral density is rela- 
tively flat: a t high frequencies, the number of variables can be reduced 
to three by estimating the spectral density values near/(tu H ± as 
described by Farrow etal (82). Briefly, /(0. 87* -w^) is determined from 
Equation 3, and /(0.921oj h ) and /(0.955u> H ) are calculated directly 
from it using the assumption that J(a>) c - l/a>*. One advantage to 
measuring R lp is that, in contrast to R 2 , contributions from confor- 
mational exchange are eliminated (/? ex ^ 0) as long as the nitrogen 
carrier is placed on resonance and the spin lock power is sufficiently 
high (51, 83, 84). This allows a direct calculation of /(t*> K ) and /(0) 
(strictly /(ftj» e ), the magnitude of the effective field/frequency in the 
presence of the spin lock) from the measured relaxation rates and 
steady-state NOE. Uncertainties in the spectral densities were deter- 
mined by repeating the calculations 500 times using the S.D. values 
of the NMR measurements and Monte Carlo methods to generate a 
normal distribution and are described in more detail in Ref. 85. The 
calculations were done using the program Mathematica 5.0 and its 
simulated annealing protocol (86); statistical analyses were done 
with the program JMP IN 5.1. 

Relaxation measurements were done at two fields to permit a finer 
mapping of the spectral density and more specifically to test the 
assumptions inherent in the reduced spectral density analysis. In addi- 
tion, since R ex scales with the square of the applied magnetic field, it is 
possible to determine the contribution of R e ^ to Equation 2 by meas- 
uring relaxation parameters at two fields. Thus, R eyi and/(0) values were 
calculated from the relaxation measurements at 600 and 800 MHz as 
described in Ref. 82. The R lp relaxation data were modeled by assuming 
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that the ef fect of its neighbors (/) on the correlation time of a residue (J) 
decreases exponentially as the distance from the residue increases and 
was first described in Ref. 51, 



\0-j)\ \ 

A / 



(Eq.4) 



where R 1 :^ is an intrinsic: residue relaxat ion rate, N is the length of the 
polypeptide, Vis the residue molecular volume (87). and A is the per- 
sistence length of the polypeptide in residues. 

A different solution to Equations 1—3 was proposed by Lipari and 
Szabo (88), who derived a simplified spectral density function J(u>) LS on 
the assumption that global molecular reorientation (r c ) and fast internal 
motions (tj are stochastically uncorrelated (89), 



(1) I 



S 2 t c (1 -S 2 )t\ 

H^p + T+J^p) (Eq - 5) 



where (1/r) ^ (1/t c ) 4- (1/rJ, The Lipari-Szabo "model-free' 1 spectral 
density reduces the number of unknown parameters in Equations 1—3 
to three: 5°\ the square of the generalized order parameter, which indi- 
cates the degree of spatial freedom of the internal motion; r c , the global 
rotational correlation time for molecular reorientation, and r e , the effec- 
tive internal rotational correlation time, which is related to both the 
amplitude and the rate of internal motion. The separability of internal 
and overall dynamics is questionable for a random coil polymer, but 
comparisons of the Lipari-Szabo parameters with those obtained for 
other folded and unfolded proteins can be instructive. We have imple- 
mented the data analysis approach developed by Schurr et at (90), in 
which ail three Lipari -Szabo parameters are optimized for each residue 
individually, since this is reported to provide a significantly better fit to 
the NMR data (90). The errors in the Lipari-Szabo parameters were 
determined by Monte Carlo analysis as described above for the spectral 
density analysis, except that only 100 points were calculated (85). 

RESULTS AND D5SCUSSBON 

NMR Resonance Assignments — Sequential assignments of ± H N , 
lb N, J3 C*, 1 H C \ and 13 C/ 3 resonances were done entirely with 

three-dimensional heteronuclear triple resonance experiments that 
use one- and two -bond scalar couplings to connect the atoms (68). 
The experiments take advantage of the comparatively wide chemical 



shift dispersions of 



13,- 



, and I V resonances in unfolded proteins 



(91, 92). Ail of the backbone resonances were sequentially assigned, 
except for Met-1, Met-21, Phe-52, Phe-58, Arg-77, and Pro-78. The 
Met-1 amino and Gly-2 amide protons exchange too rapidly to be 
observed (see below). Resonances from Phe-52/58 could be assigned to 
residue type only, because they are both preceded by weak Cys reso- 
nances. Arg-77 and Pro-78 are part of the difficult sequence Arg- Arg- 
Arg-Pro-Pro and could not be unambiguously sequentially assigned. Of 
the assigned Pro resonances, all but one have 13 C* 3 chemical shifts char- 
acteristic of the trans peptide. Pro-38 la C^ shifts are outside the canon- 
ical chemical shifts for the trans configuration but are nearer the trans 
than the cis configuration. As an example, parts of the KN(CA)CO and 
HNCACB experiments used for the backbone assignment of residues 
Gin 83 — Leu 89 are shown in Fig. 1. The assignments are listed in Table 1 of 
the supplemental data. 

A Tl/^N KSQC spectrum of Tat is shown in Fig. 2 and contains three 
notable features. First, the cross -peaks are separated into regions typical of 
the residue random coil positions (Gly, Ser/Thr, other backbone amides), 
and the chemical shift dispersions of J O N (20 ppm) and J H M (1.1 ppm) 
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FIGURE 1. Strip piots extracted from three-dimensaonai, a msde- detected hatero- 
nucEear ESiJViR expert me rats for backbone assignment. Inter- and intra- esiduai correla- 
tions are obtained from (a) an HN(CA)CO spectrum correlating H N (/) and N£/) with C £/) 
and C'{i — 1) resonances and (b) an HNCACB spectrum correlating H N (/) and N(;) with 
C"(0r C"£/ — 1), Cr(i}, and C B (j - '!) resonances. A segment of the histidine-tagged Tat- 
(1-72) at pH AA and 293 K is shown depicting connectivity between residues 83 and 89. 
Correlations in a are shown with long dashed lines: in 6, the C* correlations are connected 
with solid lines, and correlations of C p are connected with short dashed lines. 



resonances are virtually identical to those of strongly denatured ubiquitin 
(93). Second, the peaks exhibit a range of intensities, with nearly ail of the 
weak and medium intensity cross-peaks falling in the sequence between 
Cys- 47 and Leu-63. This suggests that this region of the protein is under- 
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HGURE 2.Two~dlmensfo^al : H/ 1S ^ HSQC spectrum of Tat at pH 4„ 1 and 293 Lg, backbone amide proton resonances are narrowly 
inset of a shows the three peaks associated with the side chain of the single Trp residue, b, the dashed region of a is expanded to allow 
spectrum. Backbone resonance assignments of 80 of the 84 n on proline resonances are shown. Side chain Asn and Gin NH 2 resonances are 
Arg resonances are outlined in a dashed ellipse. 



over the range of 7 A - 8.9 ppm. The 
of the crowded centra! region of the 
rh a solid ellipse, and side c 



going conformational exchange on the millesecond to microsecond time 
scale, indicating transient structure formation. 

Third, the spectrum contains several more mostly weak cross- 
peaks than can he accounted for by the backbone and side chain 
atoms of a 92-residue protein. Sixteen of the additional resonances 
have been sequentially assigned (indicated with the designation J< a' f 
in supplemental Table IS), some were assignable to amino acid type, 
and some could not he unambiguously assigned. Eleven of the 
assigned minor resonances fall in the sequence between Cys-45 and 
Arg -69- One example of the multiplicity of cross-peaks is shown for 
the single Trp at position 31 (see Fig. 2a, inset), which exhibits one 
strong and two weaker side chain indole amine cross-peaks. The Trp 
is preceded by a Pro, so one of the two minor peaks could arise from 
the ci-s-peptide bond isomer. Another possible explanation is that 
some of the minor peaks are due to the presence of minor amounts of 
oxidized Cys residues that are either unassigned or unobserved. 
Another interesting example is Gly-64, which exhibits two amide 
cross-peaks of approximately equal intensity (Fig. 2). The nearest 
Pro to Gly-64 is separated from it by 14 residues, ruling out els- trans 
Pro isomerism as an explanation of the peak multiplicity. This sug- 
gests that some segments of the reduced, monomeric Tat protein 
exist in multiple conformations that are in slow equilibrium on the 
chemical shift time scale (milliseconds to seconds). In the case of 
Gly-64, the two resonances have comparable intensity, suggesting, 
equal populations of 2 conformers, whereas in many other cases one 
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resonance is significantly more intense than those arising from alter- 
nate conformers, suggesting one dominant conformer and minor 
alternates. Gly-64 is located between Leu and the )8 -branched He, and 
steric crowding could locally restrict the dynamics of the Giy amide. 
Since there is a variation in the intensities of the duplicate peaks across 
the sequence, this immediately suggests that the conformers populated 
arise from local interactions, as expected in an unfolded protein. 

Chemical Shifts and %^ a ----The NMR chemical shift is a sensitive 
indicator of conformation, and assignment of backbone chemical shifts 
permits an analysis of secondary structure by comparison with random 
coil values corrected for local sequence effects (73, 94). Consensus 
multinuclear ( 13 C, 13 C a , 13 C P , and 1 H iy ) chemical shift indexing (95) 
suggests that all segments of the reduced protein at pH 4.1 exist in a 
random coii conformation (data not shown). Examination of the indi- 
vidual chemical shaft indexing plots shown in Fig. 3 indicates that a 
majority of the resonances are within the random coil range and that 
rarely are there more than three consecutive resonances in the a-helix 
or /3-sheet chemical shift ranges. liowever, among ana 
1 H° i resonances, there appears to be a slight weighting of the conforma- 
tions by the a-helix, the most consistent classification being for the 
segment: around Glu-29. Unlike some other denatured proteins (50, 51, 
96), there is less evidence of a tendency to the /3-sheet conformation, 
perhaps because of a paucity of hydrophobic /3-branched amino acids in 
Tat- (1-72) (two lie and four Val). However, there is a slight suggestion 
of this in the region between Val-56 and Ile-59. The conclusions based 
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HGURE 3. Chemical shsft difference index plots of H N (a), N C (c), H" (rf), C" (e), airid 
(i;. a. difference plot of the ^hnmc* coupling constant from random coil coupling con- 
stant The random coii values for H'*, N, C, C", and H 01 have been adjusted for sequence 
dependence (correction factors for C s are unavailable). Reference sines in plots a-f cor- 
respond to thresholds where differences begin to reflect regions of possible 



on chemical shift Indexing are supported by the uncorrected ^J H >. JT .j tS! 
measurements, which are all in the range of 5.5—7.1 Hz with a mean 
value of 6.71 characteristic; of unfolded molecules (74). The results are 
shown in Fig. 3g, in which the differences of the measured values from 
random coii values are presented. They show that the entire polypeptide 
is undergoing rapid sampling of the a -helix and j3-sheet regions of Ram- 
achandran space with a slight preference for the a-helix in most 
segments. 

NMR Relaxation— -Despite the relatively poor chemical shift disper- 
sion In the NMR spectra, relaxation data could be measured for 64/34 
(60/84) non-Pro resonances at 600 MHz (800 MHz). As indicated in Fig, 
4a, the steady-state heteronuclear NOEs m easured at 600 and 800 MHz 
exhibit a relatively featureless, flattened bell -shaped variation with 
amino acid sequence, as expected for an unfolded protein (97). The 
NOEs range from —3.3 (—2.6) to —0.60 ( — 0.41) with mean values of 
-1.27 (-0.933) at 600 MHz (and 800 MHz). For comparison, an aver- 
age NOE of about —0.2 is observed for several folded proteins with 
similar lengths of polypeptide chain (98 —100). The more negative NOE 
values for Tat indicate much less restricted dynamics on the nanosec- 
ond to picosecond time scales than for the folded proteins. The ends of 
Tat exhibit the most negative values indicative of faster dynamics and 
the values gradually increase away from the C terminus, whereas the 
increase away from the N terminus is steeper. Significant deviations 
from the average values are observed for Thr-43 and Lys-6l-Ala-62. 

The R r measurements show a similar bell-shaped profile with two 
notable features (Fig, 4b). The R, values range from 0.75 s~ 1 (0.89 s~ J ) to 
1 .91 s"" 3 (1.92 s" 1 ) with mean values of 1.46 s" 1 (1.43 s"" 1 ) at 600 MHz 
(and 800 MHz), Several folded proteins of similar size show slightly 
higher average R 1 values on the order of 1.5 s" 1 at 600 and 750 MHz (99, 
101). The slower relaxation in Tat indicates a shorter rotational corre- 
lation time and faster dynamics on the nanosecond to picosecond time 
scale than for a folded protein. Similar to the NOE values, the R 1 rates 
decline near the ends of the protein more steeply at the N terminus than 
the C terminus. The lowest rates, apart from the termini, are found in 
the segment connecting the Cys-rich and basic regions, bet ween Thr-60 
and Ser-66, and suggest fast dynamics there. 

The rotating frame longitudinal relaxation rates {R ]f ) measured for 
Tat at 600 and 800 MHz are plotted in Fig. 4c. The R lp rates range from 
T5 s " 1 (1.3 s" 1 ) to 5,9 s" 1 (7.2 s _± ) with mean values of 3.26 s " 1 (3.29 
s _J ) at 600 and 800 MHz, respectively. The R 2 rates, measured at 600 
MHz (supplemental data), range from 1.6 to 7.1. s" \ with an average 
value of 3.5 s" 1 . The differences between the R 2 and the R lp rates pre- 
sumably arise from contributions to the former from slow conforma- 
tional exchange, but hydrogen exchange could also contribute to the 
higher R 2 values. In general, low R lp and R 2 measurements indicate 
unrestricted fast dynamics, whereas high values suggest restricted fast 
dynamics and possible contributions from slow conformational 
exchange (100). In folded proteins of similar length to Tat, the R 9 values 
in the absence of exchange are on the order of 8 s 1 (99). The low i^ lp 
and R 2 values and the negative NOE values measured for Tat indicate 
large amplitude fluctuations on the picosecond -nanosecond time scale 
characteristic of a random coil conformation. 

The R lp relaxation data at 600 MHz were fit to Equation 4, in which 
the influence of neighboring residues is modeled as a decaying exponen- 
tial (51). The flattened, bell-shaped solid curve shows the behavior pre- 
dicted for a random-coil polymer of uniform composition (Fig. 4d). The 



secondary structu re formation. The plot: ranges in a-f cor? espond to twoS.D. values from 
the mean value determined from the chemical shift tables in the Biologica! Magnetic 
Resonance Bank data base. 
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FiGURE A, Relaxation measurements of Tat-(1-72) protein at pH 4^ and 293 K, 
determined at 600 MHz (O) and 800 MHz (□) fseSd strengths, a, heteranuciear NOE; 
6, longitudinal relaxation, /?-, ; c, rota ti rig frame relaxation, /? , p ; c/, data at 600 MHz field 
strength (<&) p sotted aiong with the predicted behavior for a random coil polymer of 
uniform composition (solid line) and the variation in relaxation when residue contribu- 
tions are weighted by residue volume {dashed line). 

dashed line shows the variation in relaxation when residue contribu- 
tions are weighted by residue volume (8*7), Although a number of indi- 
vidual residues deviate from the volume-weighted model, overall the 
theoretical line follows the data fairly closely and is a much better fit 
than the uniform polymer model. The minima in the model correspond 
mainly to the small, flexible residues G3y ; Ala, and Ser, whereas the 
maxima are found at the positions of Trp, Arg, and Lys. In contrast to 
some other applications of this model to denatured proteins (51, 102, 
103), there are no obvious regions with large positive deviations from 
the theoretical curve, further evidence that reduced Tat- (1—72) at pH 
4.1 is predominantly a random coil. The segment from Pro -23 -Pro -38 
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FIGURE 5. Reduced spectrai density mapping of motions for His -tagged Tat-(1-72) 
at pH 4.1 and 293 K, a, at 0 MHz ( O ), calculated u^ing the measurements from two fields 
according to the method of Ref. 82; b, at 61 MHz (A) and 81 MHz (V); c, 522 MHz (O) and 
696 MHz (LVJ), based on estimation of J{Q.S7<d w ). 

contains 5 prolines, and the measured R x values for most residues in 
this region are greater than the calculated ones. This suggests that the 
prolines restrict dynamics on the millisecond to microsecond time 
scale, stiffening the backbone in this region. In the C terminus, from 
residue 60 onward, the measured values generally fall below the calcu- 
lated ones, suggesting greater flexibility hi this region of the molecule. 
One exception is the high value for Gly-64, suggesting restricted motion 
and slow exchange at this position (102). 

One final observation is that the region of the protein spanning resi- 
dues 45—60 (Cys-rich region and core) contains the fewest dynamics 
measurements. This is because the peak intensities in this region are 
low. Since the largest number of assigned minor peaks is also found in 
this segment (supplemental data), this supports the suggestion that 
some residues in this segment undergo slow conformational exchange 
and are the most likely sites of folding nuclei. 

Spectral Density Values—- The values of the spectral density of atom ic 
motions measured at 0, 61, 81, 522, and 696 MHz are plotted in Fig. 5. 
The high frequency values make a small, relatively uniform contribution 
to the relaxation across the sequence except at the N terminus, where a 
significant increase in high frequency motions is observed for the first 10 
residues at 522 MHz (Fig. 5a). This is in contrast to urea- unfolded 
apomyoglobin in which minima in /(0.87oj h ) plots correspond to max- 
ima in buried surface area in the folded protein (102), suggesting that 
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hydrophobic interactions persist even in 8 M urea. The lack of definition 
in the /(522) and / (696) plots for reduced Tat-(I— 72) suggest a lack of 
formation of any residual structure at pH 4.1. 

The spectra! density profiles at 61 and SI MHz are highly similar and do 
show some variation with sequence (Fig. 5b). The ends of the protein, res - 
idues Lys-61 to Leu-63 and Thr-43, exhibit the smallest contributions at 
niidfrequencies. Interestingly, in the acid-denatured state of apomyoglobin, 
maxima in buried surface area correlate weakly with maxima in the /(u> M ) 
plot (104), suggesting that /(tt^) is sensitive to formation of transient fo Iding 
nuclei. The smaller /(w N ) values in Tat near Lys-61 and at the termini sug- 
gest more flexibility in these regions. However, there is little evidence of 
transient folding in the data for Tat- (1- 72). 

The low frequency spectral densities cover a wider range of values but 
also contain the highest levels of error in comparison with the values 
calculated at high frequencies (Fig. 5c) . This arises from the uncertain- 
ties involved in the T ] o measurements. The most notable feature is a 
peak in slow motions centered at the His 6 affinity tag. There are also less 
well defined peaks in the proline-rich region and in the basic region. The 
Cys-rich segment contains the fe west measurements, and they are asso- 
ciated with some of the largest errors. The errors arise from the weak 




FIGURE 6. Order parameters determined from the tw©-f?e3d Sdiurr-Lipari-Szab© 
mode3~free method with # ex fO) (mean vaiue r dashed Sins). Aii errors in the order 
parameters were determined based on 100 Monte Cario simulations of the threshold 
acceptance method of the simulated annealing optimization. 
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peak intensities in this region of the protein, probably indicating the 
presence of conformational exchange in this region. 

Model-free Analysis — Lipari-Szaho model-free analysis of the relaxation 
measurements is shown in Fig. 6 and in the supplemental data. Calculations 
were done both separately for the data collected at each field (supplemental 
data) and for the two fields combined (Fig. 6} and both with and without R^ y 
terms. The following observations were made in aii analyses. In general, the 
contributions were less than 1 s 1 for most residues, the largest value 
being 2,5 s " \ The average S 2 values were 0.55 and 0.50 for calculations with 
and without R tZV , respectively. These results are very similar to analyses of 
relaxation data in other unfolded proteins, where order parameters in the 
range of 0.4 — 0.6 were determined (82, 105). Although die most ordered 
part of the protein is found in the His 6 affinity tag, some of the highest errors 
in the analysis are found here. This region also exhibited the highest R 2 
values, suggesting that accouxUing for conformational exchange or hydro- 
gen exchange, not detectable by R Xn , may improve the analysis. The r c val- 
ues show a slight hell -shaped variation with sequence (supplemental data), 
the correlation times at the ends of protein being smaller than in the center. 
Furthermore, the average r c values (2.1 and. 3.3 ns, calculated with and 
without R^ respectively) are barely a factor of 10 greater than the average r e 
values (0.19 and 0.25 ns), and in some cases the errors in the internal and 
overall correlation times overlap. Both of these observations support the 
notion that HJV-1 Tat--(_1— 72) exists in a random coil conformation in 
which there is no clear separation of internal and overall rotational corre- 
lation times (97). 

Effect of pH — The NMR samples used for the assignment and relax- 
ation analysis of Tat were stable for over 1 year. Fig, 7 shows the 
effects of pi t on the Tl/^N HSQC spectrum of Tat- (1 -72). The 
most obvious result is an overall reduction of cross-peak intensities. 
Fast hydrogen exchange with water might account for this, because 
exchange at rates greater than 10 3 s " 1 will result in the loss of signal 
intensity from chemical exchange line broadening. To determine if 
the observed losses are a result of the increasing rates of hydrogen 
exchange, the theoretical hydrogen exchange rates in unstructured 
Tat at various pH values were calculated, taking into account the 
nearest neighbor inductive and steric effects (106) (not shown). For 
unstructured Tat, the exchange rates between pH 3.3 and 5.8 are 



\ 110 




i 11? 




\ its 
































\ 134 








i 133 





o 

o 

o 

03 
CL 

CD 



O 



o 
o 

CD 



l-iGURE "/.Two-dimensional ""H/^N HSQC spectra of Tat-{1 -72) at 293 K observed at pH 33 (a), pH 4**E (fcland pH 5,8 (c). AH samples are — 1 mw and were obtained from a single 
expression/purification. Each spectrum was collected with 32 transients, 204S X 256 complex points, and sweep widths of 10 ppm in F2( 1 H) and 24 ppm in F'i C ^N). 
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predicted to increase on the order of 250-fold. The calculated 
exchange rates at pH 5.8 range from 0.91 s~ ' for Gln-92 to 3.8 X 10 ' 
s 1 for Gly-2, and it is likely that for many of the peaks, the intensity 
loss is attributable to rapid hydrogen exchange. For example, the 
histidines in the affinity tag and the Cys residues are predicted to he 
the fastest exchanging amides, and their cross-peaks are lost early in 
the pH titration. In these cases, the loss of cross -peaks from the 
spectra is thus indirect evidence that a residue is not involved in a 
stable, folded conformation. However, detailed analysis of the peaks 
shows that hydrogen exchange alone cannot explain all of the peak 
heights (e.g. Thr-40 and Gly-64 are predicted to exchange slowly, yet 
they disappear early in the titration (Fig. 7)). These results suggest 
that some cross -peaks lose intensity because of the development of 
local conformations that are in intermediate exchange on the milli- 
second to microsecond time scale as observed in the molten globules 
of two other proteins (58). 

Conclusions — Natively unfolded proteins have been classified into 
two categories, depending on whether they exist in a random coil con- 
formation or in a condensed premolten globule (32). Measurement of 
both static and dynamic multinuclear NMR parameters shows that Tat - 
(1—72) exists predominantly in a random coil conformation at pH 4.1. 
However, multinuclear NMR has also uncovered evidence for multiple 
backbone conformations mainly, but not exclusively, in the Cys-rich 
region and core. The possible origin of the minor cross-peaks includes 
cis-trans proline peptide isomerization, minor Cys oxidation, and mul- 
tiple conformers in slow equilibrium. The multiplicity of some peaks in 
the spectra, together with broadened peaks and the changes in peak 
intensity as a function of pH suggest that the Cys-rich and core regions 
form transiently stabilized structures at acidic and neutral pH. The pres- 
ent results are pertinent to Tat interactions with intracellular binding 
partners, such as cyclin IT, that are expected to encounter only reduced 
Tat in the intracellular environment. Cyclin IT probably recognizes the 
transiently stabilized structure that forms in the Cys- rich region of Tat. 
Furthermore, the affinity of Tat for the loop region of TAR is greatly 
increased by interaction with cyclin Tl , suggesting binding-induced folding 
(107), another feature of natively unfolded proteins. Multinuclear NMR is 
likely to be of value in determining the structures of complexes of Tat and its 
interaction partners. Finally, there is considerable interest in developing a 
T at vaccine (70) based on the presence of Tat antibodies in HIV-1 -infected 
individuals who are long term nonprogressors to AIDS. The antibodies 
raised against oxidized protein putatively recognize conformational 
epitopes, suggesting that at neutral pH parts of the protein exist in a stable 
conformation. The present dynamics analysis suggests that the most likely 
region to fold is the Cys- rich region, and the formation of disulfide bonds 
could stabilize local structure there. However, the high positive c: barge den- 
sity, paucity of hydrophobic residues, and present dynamics analysis sug- 
gest that the remainder of the protein is unlikely to form a stable conforma- 
tion even at neutral pH. 
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